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[57] ABSTRACT 

A system integrates data elements from an operational 
database of an Internet Service Provider (ISP) and Com- 
merce Service Provider (CSP) into a predetermined format 
for supporting collection of the Internet and/or electronic 
commerce data. The system includes a database storing the 
electronic commerce data for interrogation by the ISP/CSP, 
a user station, and a design database. The design database 
includes a logical data model providing a description of the 
database faciUtating the integration of the Internet and/or 
electronic commerce data and fecilitating query and report 
access of the database. The design database also includes a 
formatted file library providing classifications including at 
least one of process characterizations, customer 
descriptions, preference determinations, and behavior 
patterns, the classifications being reusable for at least one of 
different technical processes and different customer prob- 
lems. An optional a user interface is also provided which is 
capable of providing the functions of query access and 
report access of the design database. 
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FIG. 15A 
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FIG. 19 
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FIG. 20 



Business Operation Manag 


ement - Subscriber Services Metrics 


Table Name 


Table Tvoe 


Description 


Service^Subscriber 


Fad 


Facilitates tracking the number of subscribers 
for each dial-up or dedicated line service over a 
range of time periods. Enables management to 
track trends in the customer base as a result of 
business pradices or competitive influences. 


Service^Rate 


Fad 


Facilitates tracking billing rates for each dial-up 
or dedicated line sen/ice over a range of time 
periods. Enables billed revenue to be calculated 
hA^pd nn r^tp^ pffpdivp at th6 niuen time 

Important for comparing revenue over various 
periods of time. 


Billed^Revenue 


Fad 


Trfl^*lrc inrliulHi lal hillinn tranQariinnc fnr fiJich 

liClvl\>> lilUIVIvJUCll Ullllliy llQMOClvliUIIO IV/l CCXvil 

subscriber at the time of billing. Used to calculate 
total billed revenue on a per subscriber basis for 
any point in time for calculating profitability 
indices. E.g. Most/least profitable customers. 


Subscriber.Status 


Fad 


Indicates the subscriber's status (e.g. adive, 
tenninated) over the time dimension. 


SubscribecAddr_Status 


Subset 


1 IqpH fnr Qiihorrihpr aHHrp^^ rhannp^ nvpr time 

The subscriber record contains the cunrent 
address. This record indicates past addresses. 


Datejime 


Dimension 


Allows queries from various perspectives across 
the time dimension Currentiv the oranularitv for 
this feature set is one month. 


Billing.Master 


Dimension 


Describes the tvoes of transadions for which 
bills are rendered. Enables management to 
track revenue generated for each type of 

nmHi ip4 snH con/if^p 
prUUUl^l dliU oci viUc. 


Service^Description 


Dimension 


Describes the attributes of each dial-up or 
dedicated access service. Enables 
management to track revenue and subscribers 
for each type of sen/ice over time. 


Subscriber 


Dimension 


Identifies and describes the person or entity who 
subscribes to the sen/ices offered by the CSP. 


SubscribBLServices 


Subset 


Allows for subscribers that have more than one 
senrioe. 
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COLLECTION AND INTEGRATION OF 
INTERNET AND ELECTRONIC COMMERCE 
DATA IN A DATABASE DURING WEB 
BROWSING 

BACKGROUND OF THE INVENTION 

1. Field of the Inveotion 

The present inveotion is generally related to a computer 
architecture and method for integrating data elements from 
Internet service providers (ISP) and commerce service pro- 
viders (CSP) operational database(s) into predetermined 
format for supporting collection of Internet and/or electronic 
commerce data, and more particularly, to a computer archi- 
tecture and method for integrating data elements from 
commerce service providers *s operational database(s) into 
predetermined format for supporting collection of Internet 
and/or electronic commerce data over or from the World 
Wide Web for ISPs and CSPs. 

2. Badcgroimd of the Related Art 

More and more people are using the Internet as a method 
of communicating, advertising and shopping for and pur- 
chasing goods. The sale of Internet services is growing at an 
amazing rate. The number of projected users into the year 
2000 is dramatically impacting the communicatioos industry 
both from the standpoint of an opportunity to realize new 
business aixl as a concern due to the potential loss of 
traditional revenue sources. FIG. 1 illustrates this dramatic 
increase sales in World Wide Web (WWW or web) servers, 
a simimary of which is presented below: 





WORLD WIDE WEB SERVER SALES FORECAST 




1995 


1996 


1997 


1998 1999 


Intranet 
Interact 
Total 


475 
621 
1,096 


2,673 

979 

3,652 


5,483 
1,410 
6,893 


9,210 13433 
1.777 2,159 
10,987 15,292 



The explosive growth in PCs, Servers and Internet related 
software has cultivated a need for companies to better 
understand their customer's needs. To better understand 
these needs, many gigabytes of data must be collected and 
anal)^ed to arrive at the best way to service the customer. 

Market and industry analysts alike, believe that the Inter- 
net will prove to be tbe most significant innovation in 
modem history ance the light bulb and automobile. The 
method in which we perform daily business operations will 
be changed forever due to this new technology. Many 
technology based companies in the computer industry are 
scrambling to outline new products and services using and 
e3q)loiting the Internet as a vehicle to increase market share 
and revenue, while increasing productivity and cutting 
operational costs. FIG. 2 is an illustration of the estimated 
growth in web users over the next several years. 

Id an efifort to meet tbe above needs of digesting tbe vast 
amounts of information on the web, companies have 
designed many browsers and millions of web pages to 
access, retrieve and utilize this information. In addition to 
the Internet, companies have set up local "intranets'* for 
storing and accessing data for running their organizations. 
However, the sheer amount of available information is 
posing increasingly more difficult challenges to conven- 
tional approaches. 

A major difficulty to overcome is that information con- 
tained on the web or web pages are often dispersed across 
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the network al many sites. It is often time-consuming for a 
user to visit all these sites. One conventional approach used 
to access this information more effectively is called a search 
engine. A search engine is actually a set of programs 

5 accessible at a network site within a networic, for example a 
local area network (LAN) at a company or the Internet and 
World Wide Web. One program, called a "robot" or "spider," 
pre-traverses a networic in search of documents and builds 
large index files of keywords found in the documents. 

10 A user of the search engine formulates a query comprising 
one or more keywords and submits the query to another 
program of the search engine. In response, the search engine 
inspects its own index files and di^lays a list of documents 
that match the search query, typically as hyperlinks. When 

j5 a user activates one of the hyperlinks to see the information 
contained in the document, the user exits the site of the 
search engine aod terminates tbe search process. 

Search engines, however, have their drawbacks. For 
example, a search engine is oriented to discovering texmal 

20 information only. In particular, they are not well-suited to 
indexing information contained in structured databases, e.g. 
relational databases, voice related information, audio related 
information, and the like. Moreover, mixing data from 
incompatible data sources is difficult in conventional search 

25 engines. 

Another disadvantage with conventional search engines is 
that irrelevant information is aggregated with relevant infor- 
mation. For example, it is not uncommon for a search engine 
on the web to locate hundreds of thousands of documents in 

30 response to a single query. Many of those documents are 
found because they coincidenlally include the same keyword 
in the search query. Sifting through search results in the 
thousands, however, is a daunting task. 
Accordingly, we have determined that there is a need to 

35 be able to effectively collect the data, store the data and/or 
provide useful marketing information indicative of events 
occurring on the web. For example, data which indicates 
^ere a user has been in prior sessions may be useful in 
designing future products accessible via the web. We have 

40 also determined that there is a need for an architecture and 
method used to support and analyze Internet and/or elec- 
tronic commerce data over or firom the World Wide Web for 
CSP& and ISPs. 
We have further determined that there is the need for an 

45 architecture and me±od used to correlate user, application, 
and access functions, and to store Internet and/or electronic 
coomierce data in an efficient and multi-dimensional 
method. Wb have also determined that there is a need to 
provide a tool set that can easily communicate with, or 

50 become a subset of, an existing scaleable data warehouse to 
provide lotemet and/or electronic commerce marketing 
decision support. Unfortunately, conventional architectures 
and/or techniques are unable to otganize and present this 
information in an efficient manner. 

SUMMARY OF TOE INVENTION 
To overcome the above-detailed disadvantages and short- 
comings of the prior art, it is a feature and advantage of the 
present invention to effectively collect the data, efficiently 
60 store the data and/or provide useful maiketiiig information 
indicative of events occurring on the web. It is another 
feature and advantage of the present invention to collect data 
which indicates where a user has been in prior sessions, and 
which may be useful in designing future products accessible 
65 via and for the web. 

It is another feature and advantage of the present inven- 
tion to provide an architecture and method used to support 
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and analyze InterDet and/or electronic commerce data over 
or from the World Wide Web for CSPs/ISPs. 

It is another feature and advantage of the present inven- 
tion to provide an architecture and method used to correlate 
user, application, and access functions and then to correlate 
these functions with business operational functions. 

It is another feature and advantage of the present inven- 
tion to provide a tool set that can easily communicate with, 
or become a subset of, an existing scaleable data warehouse 
to provide Internet and/or electronic comooerce marketing 
decision support. 

Id the attainment of the same, the present invention 
provides an architecture and method of supporting, 
collecting, storing, and analyzing Internet and/or electronic 
commerce data over or from the World Wide Web for ISPs 
and CSPs. The present invention also includes one or more 
computer systems employing the method, a computer read- 
able memory storing the medium, and a method £Dr storing 
the synchronization i^cess in memory and in a storage 
device. 

The present invention combines improved methods of 
data collection, translation and storage with enhanced analy- 
sis techniques to e3q}loit the vast amount of information 
available through emerging uses of the web, and other 
internet-based systems. The results can be used as part of a 
customer's decision support system to allow better (e.g., 
more statistically valid) business decisions to be made faster. 

The present invention concentrates on opportunities 
within, for example, the ISP sector of the telecommunica- 
tions industry, and provides ISRs the insight into their 
customer's services usage, demographics, and behavior pat- 
terns. Through this insight, ISPs should be able to support 
their decisions in such areas as customer retention, service 
pricing and profitability, and target marketing campaign 
using analytical methods based on customer and market 
data. Furthermore, ISPs can provide the knowledge and 
capabilities to the hosted business customers in order for 
those customers to gain similar insight into their users to 
make effective and customer-driven business decisions. 

In addition, the present invention is complementary with 
the current Internet Services Delivery Platform program. 
One environment where the present invention operates is 
where the consumer or business customer is able to utilize 
the CSP*s/ISP's web-based resoxirces through a combination 
of access and backbone networks which, in mm, are con- 
nected to, for example, the global Internet to interface with 
the rest of the internet resources. The access networks 
consist of many Point of Presences (POPs) which provide 
dial-up (e.g., analog/digital, ISDN) access service or dedi- 
cated digital (e.g.. Frame Relay, ISDN) access senrice using 
either modem pool technologies and/or access server/router 
for the customers 

The access networks also provide Dynamic Host Com- 
munication Protocol (DHCP) services, a Ptoxy Cache 
Server, and a firewall to facilitate the service administration, 
address translation, security authentication and/or 
authorization, and web operation efficiency. The backbone 
networks^ consisting of switches and routecs, optimize the 
transport between the access networiss and the Internet 
Service Center. Internet Service Centers may provide Web 
Hosting Service and Electronic Commerce Applications. 

The present invention provides an architecture supporting 
value-added services to improve the operation and services 
of ISPs/CSPs. The present invention leverages data ware- 
house and data mining techniques augmented with network- 
ing elements and web technologies. The goal of the present 
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invention is to provide the customers with dynamic, flexible, 
and adaptable interfaces and systems to fully utilize the 
interactivity and the tracking of the interactivity of the new 
Web medium. As a result, decision makers will be able to 

5 access decision support information through a secure net- 
work in a simple (ease of use), timely (fast enough), usable 
(valuable) and personalized (customized) fashion. 
Furthermore, this information will allow the customers to 
make intelligence decisions and take timely actions to 

10 achieve their business goals. 

The present invention also is a decision support solution 
targeted at an Internet Service Provider (ISP) and Commerce 
Service Provider (CSP). CSPs and ISPs are service provid- 
ers who offers a ^1 range of value-added services such as 

15 electronic commerce applications. Its purpose is to help 
ISPs/CSPs to better manage their business, customer base, 
and services by leveraging areas of expertise in data ware- 
housing and data retrieval. 
The present invention was designed in view of the fol- 

20 lowing goals which we have determined: 

1. Understand electronic commerce over the Internet. 

2. Understand the dynamics of the ISP/CSP business. 

3. Determine the type of decision support needed to 
effectively run an Internet-based service business. 

4. Understand how to leverage data warehouse and data 
storage and retrieval assets into this environment 

The present invention is a data warehouse— data storage 
and retrieval solution that is tailored to internet-based appli- 

^ cations and services. One of its primary purposes is to allow 
ISP/CSP decision-makers to exploit the intelligence con- 
tained in datasets derived from the following environments: 
ISP/CSP business operations. 
Internet or Web-based applications. 

35 Web-based commerce applications. 

Additionally, the decision-support information derived 
for specific apphcations can be provided by ISPs/CSPs to 
their electronic commerce clients as a service. The "discov- 
ery" of knowledge firom each dataset is beneficial to ISPs/ 

40 CSPs and their customers for the deep insights that it 
provides into business dynamics and customer behavior. 

The scope of WebMining includes a data warehouse that 
extracts data from web logs as well as various business 
operational data bases. The warehouse, in tum, sources three 

45 datamarts: one focused on business operations, one on the 
CSPs customers and one on electronic commerce applica- 
tions and on-line content. On-line content, in this context 
means, applications provided by the CSP as a service to 
his/her subscribers. The integration of web data to business 

50 data is key. 

WebWaiehouse and datamarts provide information in a 
business context, not a web site context. This is an important 
distinction. Although the system logs from one source of 
input to the warehouse, the data extracted from them is 

55 refined and translated into a business context before it is 
correlated with other operational data (customer, billing, 
product, etc.) and put into the warehouse. In other words, the 
warehoused data is refined to represent products being sold 
through electronic commerce applications^ advertisements 

60 being seen by customers, specific content being viewed by 
customers, etc., not URLs. The extraction utilities and 
translation libraries that provide content for the warehouse 
will be as important to our offering as the decision support 
functions at the back end. 

65 The systems elements of the present invention include: 
1. mapping module: systems to collect, interpret, analyze, 
translate, refine and correlate customer Internet usage. 
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Web/Intemet application data, electronic commerce 
application data and other operational data. 

2. storage module: Scaleable Data Warehouse to structure, 
organize, store and manage the data. 

3. data viisualization: business dedsion^pport solutions 5 
based on knowledge analysis and discovery of the data 
contained in the storage module. 

The benefits of the present invention include: 

1. providing decision support and adding operational 
information for the ISP/CSP to enable new/improved lo 
value-added services, allow the implementation of 
equitable and value-based pricing, actdeve better qual- 
ity of service, manage capacity, and add bill-back 
capabilities for charge bade scenarios. 

2. providing to the ISP's/CSP's customers valuable infor- 15 
mation such as decision support for proactive targeted 
marketing, usage and customer preferences feedback 
on the web applications, usage information for business 
access, direct decision support access for specific ISP/ 
CSP customers' applications. 20 

The WebWarehouse designer of the present invention 
provides a core design that is tailored for the decision- 
support functions. Extensions and/or changes to this design, 
however, may be optionally required to accommodate the 
ISP's/CSP's operational data and, perhaps, any additional 25 
decision-support requirements that surfaced during the data 
discovery phase. WebWarehouse designer contains two 
main components: a logical data model and a formatted 
library. The logical data model provides overall descriptions 
of the WebWarehouse. This description determines how well 30 
the WebWarehouse can integrate additional datasets and 
how effective the access of WebWarehouse will be for 
queries and reports. The Formatted File Library provides the 
intermediate classifications such as process 
characterizations, customer descriptions, preference 35 
determination, and behavior patterns. These classifications 
can beneficially be reused in different technical process and 
different customer problems. 

In accordance with one embodiment of the invention, a 
system integrates data elements from an operational data- 40 
base of an Internet Service Provider (ISP) and Commerce 
Service Provider (CSP) into a predetermined format for 
supporting collection of the Internal and/or electronic com- 
merce data. The system includes a database storing the 
Internet and/or electronic commerce data for interrogation 45 
by the ISP/CSP, and a user station inchiding a design 
database. The design database includes a logical data model 
providing a description of the database fiicilitattng the 
integration of the Internet and/or electronic commerce and 
operational data and facilitating query and report access of 50 
the database. The design database also includes a formatted 
file library providing clasdfications including at least one of 
process characterizations, customer descriptions, preference 
determinations, and behavior patterns^ fiie classifications 
being reusable for at least one of different technical pro- S5 
cesses and different customer problems. As optional, a user 
interface is also provided which is capable of providing the 
functions of query access and report access of the design 
database. 

One or more computer memory products are also pro- 60 
vided. One or more computers memory stores, and one or 
more computers executes a program including the above 
process steps. 

There has thus been outlined, rather broadly, the more 
important features of the invention in order that the detailed 65 
description thereof that folbws may be better understood, 
and in order that the present contribution to the art may be 
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better appreciated. There are, of course, additional features 
of the invention that will be described hereinafter and which 
will form the subject matter of the claims appended hereto. 

In this respect, before explaining at least one embodiment 
of the invention in detail, it is to be understood that the 
invention is not limited in its application to the details of 
construction and to the arrangements of the components set 
forth in the following description or illustrated in the draw- 
ings. The invention is capable of other embodiments and of 
being practiced and carried out in various ways. Also, it is 
to be understood that the phraseology and terminology 
employed herein are for the purpose of description and 
should not be regarded as limiting. 

As such, those skilled in the art will appreciate that the 
conception, upon which this disclosure is based, may readily 
be utilized as a basis for the designing of other structures, 
methods and systems for carrying out the several puiposes 
of the present invention. It is important, therefore, that the 
claims be regarded as including such equivalent construc- 
tions insofar as they do not depart from the spirit and scope 
of the present invention. 

Further, the purpose of the foregping abstract is to enable 
the U.S. Patent and Trademark Office and the public 
generally, and especially the scientists, engineers and prac- 
titioners in the art who arc not familiar with patent or legal 
terms or phraseology, to determine quickly from a cursory 
inspection the nature and essence of the technical disclosure 
of the application. The abstract is neither intended to define 
the invention of the application, which is measured by the 
claims, nor is it intended to be limiting as to the scope of the 
invention in any way. 

The above objeas of the invention, together with other 
apparent objects of the invention, along with the various 
features of novelty which characterize the invention, are 
pointed out with particularity in the claims annexed to and 
forming a part of this disclosure. For a better understanding 
of the invention, its operating advantages and the specific 
objects attained by its uses, reference should be had to the 
accompanying drawings and descriptive matter in which 
there is illustrated preferred embodiments of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the present inven- 
tion and the advantages thereof, reference is now made to 
the following descriptions taken in conjunction with the 
accompanying drawings in which like ntmibers designate 
like parts, and in which: 

FIG. 1 is an iUustration of the dramatic increase sales in 
World Wide Web servers; 

FIG. 2 is an illustration of the estimated growth in web 
users; 

FIG. 3 is an illustration of an Intemet Services Delivery 
Platform program indicative of one environment where the 
present invention operates; 

FIG. 4 is a high-level block diagram of a computer system 
with which the present invention can be implemented; 

FIG. 5 is a diagram of a network in which the present 
invention can be implemented; 

FIG. 6 is a Sow chart of the frameworit of the decision 
analysis process of the present invention; 

FIG. 7 provides a functional view of the system's ele- 
ments in an Intemet environment; 

FIG. 8 provides an expanded view of the architecture of 
the present invention; 

FIG. 9 is an illustration of the various data sources and 
outputs fisr the present invention; 
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FIG. 10 is a (k tailed illustration of how the components 
interact with each other and the customer environment in the 
present invention; 

FIG. 11 provides an overview of the architectural com- 
ponents in various configurations of the present invention; 

FIG. 12 is a Uble providing additional hardware, 
software, database, and information tool options that may be 
used in conjunction with the present invention; 

FIG. 13 is a detailed illustration of the data discoverer 
architecture in accordance with the present invention; 

FIG. 14 is an iUustration of a functional architecture of the 
data discoverer in accordance with the present invention; 

FIGS. 1S-15A are illustrations of the architecture of the 
WebWarehouse designer of the present invention; 

FIG. 16 is an illustration of the WebWarehouse data 
sources for the services portfolio management, subscriber 
base management, and application management; 

FIG. 17 is a table key indicating relationships and table 
types; 

FIG. 18 is a table key indicating different types of tables; 

FIG. 19 is an exemplary data structure indicating sub- 
scriber service metrics data stored by the WebWarehouse 
designer and received from the data discoverer process; 

FIG. 20 is an exemplary table indicating subscriber ser- 
vice metrics data stored by the WebWarehouse designer and 
received from the data discoverer process; and 

FIG. 21 is an exemplary table indicating the various 
environmental and business data administered and/or col- 
lected by the data discoverer process and placed in the data 
warehouse. 

Notations and Nomenclature 

The detailed descriptions which follow may be presented 
in terms of program procedures executed on a computer or 
network of computers. These procedural descriptions and 
representations are the means used by those sldlled in the art 
to most effectively convey the sub^ance of their work to 
others sldlled in the art. 

A procedure is here, and generally, conceived to be a 
self-consistent sequence of steps leading to a desired result. 
These steps are those requiring physical manipulations of 
physical quantities. Usually, though not necessarily, these 
quantities take the form of electrical or magnetic signals 
capable of being stored, transferred, combined, compared 
and otherwise manipulated. It proves convenient at times, 
principally for reasons of common usage, to refer to these 
signals as bits, values, elements, symbols, characters, terms, 
numbers, or the like. It should be noted, however, that all of 
these and similar terms are to be associated with the appro- 
priate physical quantities and are merely convenient labels 
applied to these quantities. 

Further, the manipulations performed are ofiien referred to 
in terms, such as adding or comparing, whicb are commonly 
associated with mental operations performed by a human 
operator. No such capability of a human operator is 
necessary, or desirable in most cases» in any of the opera- 
tions described herein which form part of the present inven- 
tion; the operations are machine operations. Useful 
machines for performing the operation of the present inven- 
tion include general purpose digital computers or similar 
devices. 

The present invention also relates to apparatus for per- 
forming these operations. This apparatus may be specially 
constructed for the required purpose or it may comprise a 
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general purpose computer as selectively activated or recon- 
figured by a computer program stored in the computer. The 
procedures presented herein are not inherently related to a 
particular computer or other apparatus. Various general 

5 purpose machines may be used with programs written in 
accordance with the teachings herein, or it may prove more 
convenient to constmct more specialized apparatus to per- 
form the required method steps. The required structure for a 
variety of these machines will appear from the description 

10 given. 

BEST MODE FOR CARRYING OUT THE 
INVENTION 

The present invention provides an architecture and 

IS method of supporting, collecting, storing and analyzing 
Internet and/or electronic commerceAransactions over or 
from (hereinafter "over" or "from" are interchangeable) the 
Worid Wde Web for ISPs and/or CSPs. The present inven- 
tion also includes a computer system employing the method, 

20 a computer readable memory storing the medium, and a 
method for storing the synchronization process and in a 
storage device. The present invention provides tools that 
provide the necessary collection and analytical capabilities 
to derive knowledge from raw data collected from the web. 

25 In addition, the present invention provides scalability of 
processing and storage capacity, to meet a customer's grow- 
ing client needs. 

In the following description, for purposes of explanation, 
numerous specific details are set forth in order to provide a 

30 thorough understanding of the present invention. It will be 
apparent, however, that the present invention may be prac- 
ticed without these specific details. In other instances, well- 
known structures and devices are shown in block diagram 
form in order to avoid unnecessarily obscuring the present 

35 invention. 

The present invention combiries improved methods of 
data collection and storage with enhanced analysis tech- 
niques to exploit the vast amoimt of information available 
through emerging uses of the web, and other internet-based 
^ systems. The results can be used as part of a customer's 
decision support system to allow better (e.g., more statisti- 
cally valid) business decisions to be made faster. 

The present invention concentrates on opportunities 
within, for example, the ISP sector of the telecommunica- 
tions industry, and provides ISPs the insight into their 
customer's services usage, demographics, and behavior pat- 
terns. Through this insight, ISPs should be able to support 
their decisions in such areas as customer retention, service 
pricing and profitability, and target marketing campaign 
using analytical methods based on customer and market 
data. Furthermore, ISPs can provide the knowledge and 
capabilities to the hosted business customers in order for 
those customers to gain similar insight into their users to 
make effective and customer-driven business decisions. 

The present invention provides a complete data 
warehouse/decision support solution for Internet and/or 
electronic commerce-based applications and services. Its 
features include: 
^ A modular design that scales in size as well as function- 
ality. 

It runs on a series of platforms ranging frx)m single 
processors to massively parallel systems. 

A fiill range of design and implementation tools. 
65 Extraction and transformation programs tailored for the 
Internet and/or electronic coomierce operating environ- 
ments. 
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A choice of relational database management systems that 
includes MS SQL Server, Oracle, and Teradala. 

A comprehensive selection of decision-support capabili- 
ties. 

The present invention's modular design, hardware plat- 5 
forms and tools enable customers to select a decision- 
support ^stem that is consistent with the financial maturity 
of their business. As the business grows and decision- 
support needs become more complex, the platforms 
described herein can be upgraded in size as well as func- 
tionality. 

Hie present invention provides decision-support capabili- 
ties to help ISP*s/CSP*s manage their business from several 
perspectives: 

Providing high quality Internet Service to subscribers as 
a foundation for higher value, more profitable services. 

Managing the customer base to optimize profits and 
value. 

Providing on-line content and services as a value-added ^ 
incentive to attract and retain communities of 
subscribers, provide sources of advertising revenue, 
and potentially, to attract commerce clients. 
Providing decision-support information, as a potential 
fee-based service, to business clients to help them 25 
manage their Internet, web or electronic commerce 
applications that arc being hosted by the ISP/CSR 
Since a specific ISP/CSP may not be involved in every 
a^ect of the business described above, the present inven- 
tion's decision-support capability is modularized to allow a 30 
ISP/CSP to select the functionality that make sense for 
his/her business. 

The present invention's modular structure allows Internet 
and/or electronic commerce-based applications and services 
to be managed firom three perspectives: 35 

1. Business Operations Management to monitor and 
understand: 

Key business metrics (revenue, expenses, profitability, 

subscriber base). 
Factors that impact quality of service. 40 
The impact of marketing initiatives or competitive 

influences. 

2. Customer-Base Management to understand the relative 
value, behavioral characteristics, and preferences of 
individual subscribers or customers. 4S 

3. Electronic Commerce Management to measure the 
success of: 

Electronic-Commerce ^plications — applications pro- 
vided by the CSP's/ISP's or their business clients to 
conduct commerce over the internet. 50 
On-line content — applications provided by the CSP/ 
ISP to add value to the service offering. 
In addition, the present invention is complementary with 
the current Internet Services Delivery Platform program. 
One environment where the present invention operates is 55 
illustrated in FIG. 3. Within this environment, the consumer 
or business customer 4, 6, 8 is able to utilize the CSP's/ISP*s 
web-based resources through a combination of different 
access and backbone networks such as ISDN digital access, 
analog dial-up access, and/or dedicated access lines 60 
described below. These networks are, in turn, are connected 
to the Global Internet 2 via, for example, an internet service 
center 10 to interface with the rest of the internet resources. 

The access networks consist of many Point of Presences 
(POPs) 12, 14, 16 which provide dial-up (analog/digital, 65 
ISDN) access service or dedicated digital (Frame Relay, 
ISDN) access service using either modem pool technologies 
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and/or access server/router for the customers. The access 
networks also provide Dynamic Host Communication Pro- 
tocol (DHCP) services, a Proxy Cache Server, and a firewall 
to facilitate the service administration, address translation, 
security authentication and/or authorization, and web opera- 
tion efficiency. The backbone networks, consisting of 
switches and routers, optimize the transport between the 
access networks and the Internet Service Center 10. 

Internet Service Centers 10 may also provide Web Host- 
ing Service and Electronic Commerce Applications. One 
implementation of Internet Service Centers for ISPs/CSPs is 
a single site with multiple servers 18 (Web Server Cluster) 
interconnected through a campus network 20. In addition, 
Internet Service Centers optionally provide Authentication 
Service, Domain Name Service, Mail Service, News 
Service, Provisioning Service, and Network Management 
Service. 

The present invention provides value-added services to 
improve the operation and services of ISPs/CPSs. The 
present invention leverages data warehouse and decision 
support techniques augmented with networking elements 
and web technobgies. The goal of the present invention is 
to provide the customers with dynamic, flexible, and adapt- 
able interfaces and systems to fully utilize the interactivity 
of the new Web medium. As a result, decision makers will 
be able to access decision support information from a secure 
network in a simple (ease of use), timely (fast enough), 
usable (valuable) and personalized (customized) fashion. 
Furthermore, this information will allow the customers to 
make intelligence decisions and take timely actions to 
achieve their biisiness goals. 

The benefits of the present invention include: 

1. providing decision support and operational information 
for the ISP/CSP to enable new/improved value-added 
services, allow the implementation of equitable and 
value-based pricing, achieve better quality of service, 
manage capacity, and add bill-back capabilities for 
charge back scenarios. 

2. providing to the ISP's/CSP's, and their customers' 
valuable information such as decision support for 
proactive, targeted marketing, usage and customer pref- 
erences feedback on the web applications, usage infor- 
mation for business access, direction decision support 
access for ^ecific ISP/CSP customers* applications. 

FIG. 4 is a high-level block diagram of a computer system 
with which the present invention can be implemented. FIG. 
4 is a block diagram which illustrates a computer system 100 
upon which an embodiment of the invention may be imple- 
mented. Computer system 100 includes a b\is 102 or other 
communication mechanism for communicating information, 
and a processor 104 coupled with bus 102 for processing 
information. Computer system 100 also includes a main 
memory 106, such as a random access memory (RAM) or 
other dynamic storage device, coupled to bus 1(12 for storing 
informatioD and instructions to be executed by processor 
104. 

Main memory 106 also may be used for storing temporary 
variables or other intermediate information during execution 
of instructions to be executed by processor 104. Computer 
system 100 further inchides a read only memory (ROM) 108 
or other static storage device coupled to bus 102 for storing 
static information and instructions for processor 104. A 
storage device 110, such as a magnetic disk or optical disk, 
is provided and coupled to bus 102 for storing information 
and instructions. 

Computer system 100 may be coupled via bus 102 to a 
display 112, such as a cathode ray tube (CRT), for displaying 
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information to a computer user. An input device 114, includ- 
ing alphanumeric and other keys, is coupled to bus 102 for 
communicating information and command selections to 
processor 104. Another type of user input device is cursor 
control 116» such as a mouse, a trackball, or cursor direction 
keys for communicating direction information and com- 
mand selections to processor 104 and for controlling cursor 
movement on display 112. This input device typically has 
two degrees of freedom in two axes, a first axis (e.g., x) and 
a second axis (e.g., y), which allows the device to specify 
positions in a plane. 

The invention is related to the use of one or more 
computer systems 100 for supporting, collecting, and ana- 
lyzing Internet and/or electronic commerce over and/or from 
the World Wide Web for ISPs/CSPfe. According to one 
embodiment of the invention, computer system 100 per- 
forms the synchronization in response to processor 104 
executing sequences of instructions contained in main 
memory 106. Such instructions may be read into main 
memory 106 from another computer-readable mediimi, such 
as storage device 110. 

However, the computer-readable medium is not limited to 
devices such as storage device 110. For example, the 
computer-readable medium may include a floppy disk, a 
flexible disk, hard disk, magnetic tape, or any other magnetic 
medium, a CD-ROM, any other optical medium, punch 
cards, paper tape, any other physical medium with patterns 
of holes, a RAM, a PROM, an EPROM, a FLASH-EPROM, 
any other memory chip or cartridge, a carrier wave embod- 
ied in an electrical, electromagnetic, infrared, or optical 
signal, or any other medium from which a computer can 
read. 

Execution of the sequences of instructions contained in 
main memory 106causes processor 104 to perform the 
process steps previously described. In alternative 
embodiments, hard-wired circuitry may be used in place of 
or in combination with software instructions to implement 
the invention. Thus, embodiments of the invention are not 
limited to any ^ecific combination of hardware circuitry 
and software. 

Computer system 100 also includes a communication 
interface 118 coupled to bus 102. Communication interface 
108 provides a two-way data communication coupling to a 
network link 120 that is connected to a local network 122. 
For example, communication interface 118 may be an inte- 
grated services digital network (ISDN) card or a modem to 
provide a data commimication connection to a correspond- 
ing type of telephone line. As another example, communi- 
cation interface 118 may be a local area network (LAN) card 
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to provide a data communication connection to a compatible 50 sources. 



works and the signals on network link 120 and through 
communication interface 118, which carry the digital data to 
and from computer system 100, arc exemplary forms of 
carrier waves transporting the information. 

Computer system 100 can send messages and receive 
data, including program code, through the network(s), net- 
work link 120 and communication interface 118. In the 
Internet example, a server 140 might transmit a requested 
code for an application program through Internet 128, ISP 
126, local network 122 and communication interface 118. In 
accordance with the invention, one such application pro- 
vides for the method as described herein. Of course, the 
present invention may also be implemented on multiple 
computer systems. 

The received code may be executed by processor 104 as 
it is received, and/or stored in storage device 110, or other 
noD-voIatile storage for later execution. Id this manner, 
computer system 100 may obtain application code in the 
form of a carrier wave. 

FIG. 5 is a diagram of a network in which the present 
invention can be implemented. Referring to FIG. 5, depicted 
is a network 200 within which the present invention may be 
implemented. SmartEC server 230, according to one 
embodiment of the present invention, gathers information 
dynamically from one or more data sources, which may be 
located at different servers and may have incompatible 
formats, and outputs the information for the user according 
to predetermined criteria discussed below. 

The predetermined criteria may be defined by human 
operators according to their own needs, purposes, and pref- 
erences as part of the configuration of the server. 
Alternatively, the predetermined criteria may be determined 
based on system-defined user privileges. For example, users 
with a higher clearance or security and/or may be able to 
obtain more information than users with a lower clearance or 
security. Multiple information models and visual represen- 
tations may be defined for any server. 

A user may access the server by executing a web browser 
at client 210. Web browsers are well-known in the art, and 
are readily available from such corporations as Netscape 
Communications Corp. and Microsoft Corp. In order to 
access the server, the user at client browser 210 activates a 
hyperlink having a URL (Uniform Resource Locator). 

In the exemplary URL, the network address of the server 
is ^jecified as **www.server.com" and the optional portion of 
the URL after the question mark (?) hold user specified 
parameters. When the hyperlink is activated, the web server 
220 receives a request to initiate an session, specified by 
parameters embedded in the URL. In response, the web 
server 220 gathers information from one or more data 



LAN. Wireless links may also be implemented. In any such 
implementation, communication interface 118 sends and 
receives electrical, electromagnetic or optical signals which 
carry digital data streams representing various types of 
informatk>n. 

Network link 120 typically provides data commumcation 
through one or more networks to other data devices. For 
example, network Unk 120 may provide a connection 
through local network 122 to a host computer 124 or to data 
equipment operated by an ISP or CSP 126. ISP or CSF 126 
in turn provides data communication services through the 
world wide padcet data communication network now com- 
monly referred to as the "Internet" 128. Server 132, web 
server or proxy server, is used by ISP 126 for data collection 
purposes. Local network 122 and Internet 128 both use 
electrical, electromagnetic or optical signals which carry 
digital data streams. The signals through the various net- 
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The data or information sources can have incompatible 
formats, e.g. web page, relational database, spreadsheet, text 
file, etc. The data sources can be stored at a plurality of sites, 
for example, locally with reject to the web server 220, such 
as a hard disk at local storage 222, or externally at another 
site in the network, e.g. at mainframe 230. In fact, the data 
source can even be another, remote information web server 
240. 

The decision analysis process shown in FIG. 6 describes 
a framework of what is required to provide the business 
solution set. The architecture and process of the present 
invention includes the necessary data to solve specific 
business problems. As opposed to many traditional methods 
of collection and storage for data warehouse and mining, the 
present invention focuses on data collection and correlation 
of Web/internet information and data storage methods using 
Web technologies in distributed enviroimients. 
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As a result, data collection and data storage processes S2, all systems are interconnected, 

S4 are focused and continuous efforts, instead of general and systems (users, servers) are Web accessible, 

discrete approaches. The present invention also allows the allISP specify data (servers, database and warehouse) are 

proper archive of Uie ooUected mformation for sta istic and accessible on a secure network, 

analytic manipulaUon. The present mvention effecuvely and 5 ^^-w^^*"!^ « u'^* .... 

interactively presents the resulting business solutions. ^f, ^roach we Uke in the present mvenUon is to 

Specifically, three aspects of the processes are provided f^^^^"^ generic soluUon framework, but wit^ specific 

by the present invention business solution set: ^^US' ^t least initially, on the vertical mdustry (ISPs in the 

1. Find and coUect: this process 86 must determine what communication industry) and functionaUy on customer 
data elements are required. It is also necessary to know retention and profitability. The potential applicability for 
whether the data is available in open, closed, or registered ^° horizontal platform functionality requires further investiga- 
enviromnents. In order to execute the capture and collection tion. 

ofthedata,theability to set policy and operation parameters Each of the processes (find and collect, structure and 

must be provided to the user to allow options for data store, analysis and discovery) corresponds to a system 

searching and collection based upon their dynamic and component in this section. FIG. 7 provides a functional view 

changing business needs. Furthermore, this process should of the system's elements in an Internet environment. The 

also deal with vAicn and bow the data collection should be system's elements of the architecture of the present inven- 

executed. tion are: 

The whereabouts of the required data, sometimes, can be - ♦.^^i,- /\xT^urr^^»u\ ^on. 4\^a 

, , . ^ 1 ^ 1 on- * 1 ♦ 1* tracking module (WebTrack) 300: systems to nnd, 

known or unknown, internal or external. The ability to locate ^ , A^t.n^A xxr^u) 

the data and synthesize the multiple sources of data into 20 search, collect and capture detailed customer Web/ 

usefiil information is important as the technology evolves J°^™f ^ tUctxomc commerce usage and demo- 

and raaUircs. The foltowing daU collection options arc also graphics data. 

provided: 2. storage module (Web Warehouse) 302: Scaleable Data 

real-time: the time between locating the data and collect- Warehouse to structure, organize, storage and manage 

ing the data is negligible or about at the same time. 2s detailed customer Web/Internet and/or electronic com- 

These two activities are closely related and the actions merce data. 

are closely coordinated. 3^ intelligent decision support tool (WebSmart) 304: busi- 

on-line: the time between locating the data and collecting ness sohitions based on knowledge analysis and dis- 

the data is short because either (1) the data requires a covery of detailed customer Web/Internet and/or elec- 

certain duration to accumulate meaningful volume, or 39 tronic commerce data. 

(2) the importance of the data collection is not immi- addition, a general data warehouse 306 is also provided 

neiit. These two activities are loosely related and ^^^^^^ ^^^^ warehouse can also be referenced) for 

actions arc loosely coordinated. storage of non-web specific data, as well as possibly some 

off-line: the time between locaUng the data and oollcctmg ^^^^ preferences. Each of the 

the data IS long and these two actmUes arc separate and 3^ ^^^^ ^^^^^^^ responsively connected, direcdy or 

can be unrelati^ events, and actions need not be indirectiy.to web server 308, providing access to the internet 

coordinated at all. « » f & 

2. Structure and store: this process 88 involves parsing, . ^. • r • 1 
categorizing, cleansing, corrclating, indexing, and format- . addition the presentation of the mformation is also 
ting of the coUected data. Tbe processed data must be important to the de™ support process. It is a dehvery 
organized in a structured way (i.e., the data layout and ^ media to distribute the mformation and intelhgence to the 
segmentation). The intent of this process is to arrange the decision maker in a timely fashion. Therefore, all these 
data and/or information in a form where useful long term systems provide consistent, peisonalized, and easy to use 
and policy-related knowledge can be easily used and customer interfaces. These user interfaces should also have 
derived. the capabilities to operate in real-time, on-line, and/or off- 

3. Analysis and discovery: this process SIO recognizes the 45 line. 

patterns, trends, and exceptions of tbe data and/or the FIG. 8 provides an expanded view of the architecture of 

information based on statistic and analytic manipulation the present invention. In FIG. 8, the users define and 

techniques such as clustering, artificial intelligence, etc. The formulate the business questions and determine the queries 

goal is to provide the customers with recommended actions of the data coUected. Three major decision support user 

and/or the visual interpretation of the presented data for their 50 categories of the present invention arc: 

decision making S12. 1. ISP/CSP business managers: the present invention 

All these three processes are iterative in nature. The allows them to better understand their customers' needs and 

feedback process S14 among the processes^ and also from conduct effective mariceting and sales support to achieve 

the users and customers will establish the foundation of the their business objectives. 

continuing improvement. In addition, all three processes 55 2. ISP/CSP IT (Information Technology) Managers: the 

require user-friendly interfaces to facilitate information present invention enables them to obtain insight into the 

exchange. These user interfaces are important parts of the ISPVCSP's IT operations and provide ways to improve the 

present invention to allow solution navigation, iterative ISP 's/CSP's services. 

learning, and decision guidance. 3. ISP/CSP business clients: For the ISP's hosted business 

The following outlines the system elements and prelimi- 60 clients, the present invention enables ISP^CSPs to provide 

nary data required for the present invention. The present their business clients with detailed understanding about the 

invention intends to complement the Internet service deliv- clients' business needs. 

ery platform and provide the differentiation for our complete These users should be able to use the present invention 

suite of Internet solutions for Internet Service Providers through any Web client or networked client with properly 

(ISPs) and/or Commerce Service Providers (CSPs) in the 65 defined access privileges and procedures. The interface must 

communication industry. The assumptions of the oonfigu- aUow dynamic interaction between the users and the systenss 

ration are: to accommodate the new queries and questions resulting 
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from the changiDg business needs. This user input is bandied 
according to the current Web and user interface technologies 
sucb as HTML forms and Conimon Gateway Interface 
(CGI) or a window GUI interface interacting with the other 
elements (WebSmart, WebWaiehouse, SmartECAdmin) of 
the WebMining systems. 

WcbSmart 304 provides the mathematical Engine to per- 
form the statistic and analytical manipulation of the data and 
obtain the knowledge and intelligence for business decisions 
by the SmartEC client 314. 

WebWarehouse 302 is separated from the operational 
client-accessible systems such as Web servers and Web- 
Based databases, but is populated by daU from these sys- 
tems. It exists to make the data available for interrogation by 
the business users firom the perspectives of an enterprise 
interest and associated business benefits. Tbe data in the 
WebWarehouse is preferably time-stamped and associated 
with a defined period of time, and is usually subject-<^ented 
such as customer, product, activity. 

The change of this data in the WebWarehouse occurs as a 
result of decisioD support requirements, and update from one 
of the data sources 316. Therefore, the SmartecAdmin 312 
must have the capabilities to determine and control the 
search request and information flow between the operational 
databases and the WebWarehouse. SmartECAdmin must 
also have the ability to initiate the structural change, 
addition, and deletion of data in the WebWarehouse in an 
appropriate time based on the decision support user's needs. 

WebMap 300 is the vehicle to search and transform the 
information fi"om the data sources 316, and deliver the 
information to the WebWarehouse for storage using the 
secure network. The data sources preferably includes the 
system log^ (access log^, agent logs, mail logs, communi- 
cation server logs, accounting logs and refercr logs) on the 
Web servers, proxy server or mail server, and the user 
profiles (ISP/CSP customers visitors^ and transient travelers 
(pass-throughs)). 

From the Web operational perspectives, three major sets 
of information for Web applications and Web access must be 
also considered to determine the collection of useful data: 
(1) who is visiting, (2) what they are looking at, and (3) what 
resources they are using. There are other data sources 
available related to DNS, DHCP, and the Network Manage- 
ment Systems (NMS) which facilitate Web operations. In 
addition, the data sources should also consider third party 
analysis applications (based on the Web data). 

The data required for the WebWarehouse depends heavily 
on the scope and objectives of the business problems to be 
solved. A preliminary and generic example of tbe WebWare- 
house intended for solving tbe problenos an ISP/CSP may 
face is described as follows: 

From the user perfective, there are, at least, two types of 
users: visitors and the ISPVCSFs customers. Hie ISP's 
customers are those who subscribe to the ISP services by 
paying fees to the ISP or purchase something from the 
ISP/CSP. The visitors are thcKse who access the services via 
other Web servers without payment to the particular ISP/ 
CSR The profile data related to the ISP/CSP customers 
should inchide, for example: 

company name 

company administrator name 

user name 

address including city, state, zip code 
phone number inchiding day, night and fax 
user login name and/or IP address 
email address 
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chosen Point of Presences 

credit card information inchiding card type, number, 

name on the card 
domain name 
billing contact 

user selection of ISP's services including dial-up service, 
business ISDN service, corporate email service, server 
hosting, corporate web services. 
10 user chosen service plans and associated pricing itifonna- 
tion. 

The visitor profile data, in addition to the domain name 
and IP addresses, depends on how much information the 
visited applicatiois can entice the visitors to provide and 
15 what the visitors are willing to share with the ISP/CSP. 

From the Web operational service perspective, three 
activities are correlated: user, application (including hosting 
and commerce services), and access. Web servers contain 
minimal information about access from the networking point 
20 of view. The capture of Web access activity requires inter- 
faces to otter network accessible systems such as modem 
pools and routers. 

The user activity provides, for example, the following 
data; 

2^ domain names and IP addresses of tbe systems/users 
making the requests 
login names and IP addresses of the client systems access- 
ing the site 

referer lists from where users are accessing the server 

most common visitor domains 

most common external references 

first time visitors 

retiuning visitors 

most frequent visitors 

visitors per visitor category 

visits per visitor category 

user demographics including geography, age, income, 
40 professional. 

where do people come from when accessing this page 

where users or subscribersAdsitors go from this page. 

The activity provides, for example, the following data: 

hits to the site 

new contents for the site 

number of files sent 

hot hits (most frequent accessed files) 

most popular pages 
so types of transactions 

date and time of transactions 

time spent on a particular page 

average time spent on a page 

how many leave before the first page arrives 

details of most popular objects 

details of most popular pages 

least popular objects 

least popular pages 
^ most popular entrance pages 

most popular content categories 

next click analysis 

visits per content category 
65 visits to content combinations 

In order to understand the efSciency of the ISP/CSP 
operation, it requires an tmdcrlying resource charactcriza- 
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lion to inter-relate the above activities. There are three 
aspects of this characterization: what the structure is 
(necessary Web site application/service structure, and the 
relationships — one-to-one, many-to-one, many-to-many), 
what actually happens (logs), and how well the systems are 
performing (performance). This characterization (please 
note that this remains to be defined) is important in setting 
up a useful WebWarehouse and provide adaptable services 
for the present invention. Sample data related to resource 
utilization includes, for example: 

protocol used 

file sizes transferred 

response time 

peak traffic periods 

update interval 

most common browser types 

retrieval mettKxl 

error activities 

accounting information including billing records 

services used (for example, support). 

The present invention can be standalone or co-exist with 
the SmartEC decisions server, data bases, and data ware- 
house. The present invention accommodates both NT and 
UNIX with appropriate systems tools and capacity to enable 
access to other web servers and internet-based systems or 
networked such as NMS and special application servers. The 
present invention operates, for example, in a pure ISP 
environment initially. The capabilities of the present inven- 
tion are also extendable to the Mixed ISP and SmartEC (or 
ISP decision support) systems provider environment (such 
that SmartEC systems providers can offer professional 
services), the Intranet environment for business customers, 
and provide business customers using ISP/CSP hosting and 
Internet and/or electronic commerce services a logical per- 
spective of how they are operating. 

As depicted in FIG. 9, the data to populate WebWarehouse 
302 originates from many sources. The value of the present 
invention lies in its ability to integrate business operational 
data with web-specific and internet and/or electronic com- 
merce data to provide decision support information about 
Intemet/Web applications in a business, rather than a tech- 
nical context 

WebMap or WebTrack 300 consists of the programs and 
database structures designed to: 

Collect data from the appropriate sources. 

Interpret, analyze, translate, and refine the source data in 
order to extract the required data sets to implement decision- 
support fiinction& 

Correlate disparate data sets prior to populating Web- 
Warehouse 302. 

WebWarehouse 302 furnishes the information necessary 
to manage an Internet Service or an electronic commerce 
business from the perspectives outlined above. Different 
WebWarehouse views can be extracted to provide an infor- 
mation framework to drive specific decision-support efforts. 
WebWarehouse also includes tools that provide the appro- 
priate logical design, physical design and documentation. 
Design templates are available to identify the required data 
elements from the customer's operational databases. There 
are also processes and tools to integrate customer-specific 
elements into WebWarehouse* s core design. This architec- 
ture described herein may also be implemented as a data- 
marl. A datamart is structured to support specific decision 
support needs, typically at a departmental level. 

WebSmart 304 provides decision-support modules with 
feature sets that vary in the sophistication of the business 
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information being provided. In this context, there are several 
levels of decision-support provided by WebSmart. 
functionality that provides a basic level of decision- 
support functionality 
^ functionality that can be addressed by standard OLAP 
decision-support software. At this level, WebSmart 
provides a set of pre-formatted report templates* filters 
and metrics for each decision-support feature set. There 
are tool sets for each decision support software package 
supported by WebMining or SmartEC decisions, 
functionality used to detect previously unknown, but 
potentially useful relationships between different types 
of data. Here, WebSmart provides a set of templates 
oriented to help solve specific business problems such 
as customer retention, market basket analysis, etc. 
Additionally, Metadata is provided for each of the sup- 
ported On-Line Analytical Processing (OLAP or Knowledge 
Discovery (KD) tools, as appropriate. Metadata defines the 
data views necessary to produce the outputs required for 
each decision support feature set Each metadata package 
also includes templates to integrate customer-specific data 
into the standard design. 

FIG. 10 is a detailed illustration of how the components 
interact with each other and the customer envirorunent. The 
^ present invention works under three envirormiental factors. 
Customer engagement 320 provides key information on 
customer needs. These needs are influenced by how well 
customer operational systems 322 are running and how 
customer business actions 324 are executed. The customer 
engagement process has a major influence on the design. 
The components must interact with customer operational 
systems to determine the proper data model for WebWare- 
house 302 and how the information can be coUected and 
extracted for WebMap or WebTrack 300. FinaUy, the cus- 
tomer business actions depend heavily on what knowledge 
customers can get from the products and solutions. 

WebSmart 304 is intended to provide the best possible 
knowledge for customers to conduct their business given a 
specific business problem. There are six key components in 
^ the present architecture: 

WebMining or SmartEC decisions Front-end 326 (Data 
Discoverer 328 and Offer/System Configurator 334 and 
Migration Plarmer 336). 
^5 WebWarehouse Designer 330 
WebMap or WebTrack 300 
WebWarehouse 302 
WebSmart 304 

WebMining or SmartEC decisions Metadata Management 
50 Utilities 332. 

The WebMining Front-end facilitates the interaction 
between WebMining solutions and customers. Data Discov- 
erer automates the discovery process to enable service 
practitioners to systematically find the necessary informa- 
55 tion for defining customer problems via customer engage- 
ment. The output of the Data Discoverer will impact the 
WebWarehouse design and determine the effectiveness of 
the collectk)n and extraction utilities of WebTrack. 
The offer/systems configurator and migration pUnner 
60 function (WebMining Platform Selector) will help both 
customers and WebMining sales and service personnel to 
determine, depending on customer needs, what hardware, 
software and associated applications are required and how 
customers can move from one set of platforms to another 
65 based on the changing customer requirements. 

WebWarehouse Designer contains two main components: 
logical data model 338 and formatted h'brary 340. The 
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logical data model provides overall descriptions of what the 
WebWarehouse is. This description will determine how well 
the WebWarehouse can integrate additional datascts and 
how effective the access of WebWarehouse will be for 
queries and reports. Formatted File Library is a by-product 5 
of conducting standard Collection/Extraction Utilities 342 
and Tracking/Mapping Utilities 344. The Formatted File 
Library provides the intermediate classifications such as 
process dharacterizations, customer descriptions, preference 
determination, and behavior patterns. These classifications 10 
can be reused in different technical process and different 
customer problems. 

WebTrack collects and extracts information from cus- 
tomer operational systems. The collected and extracted 
information will be further manipulated to determine the 15 
identity and characterization of the users via Tracking or 
Mapping Utilities. Data Transformation Tools 346 load the 
data into the WebWarehouse based on the principles of the 
WebWarehouse Designer and the information discovered 
during the execution of Collection/Extraction Utilities and 20 
Tracking and Mapping Utilities. 

WebWarehouse is the physical repository of SmartEC or 
WebMining information. 

WebSmart provides the intelligence of the present inven- 
tion. Decision Support Systems (DSS) took 348 provide the 25 
access, query, and report capabilities using WebWarehouse. 
Mining tools 350 enable the present invention to discover 
hidden knowledge from existing data and information. 
Furthermore, Mining tools could establish systems model 
such that Prediction/Forecasting 352 capabilities can be 30 
realized. All this intelligence can be formulated such that the 
business impact can be clearly crystallized via Business 
Impact Report 354. 

Since there are multiple systems and databases involved 
in providing WebMining solutions, it is important to prop- 35 
erly manage, coordinate, and reconcile the different sets of 
metadata. Metadata Management Utilities serve as a control 
center of these different metadata for the present invention. 

The above architecture is mainly examined from input 
and output process perspectives among physical compo- 4Q 
nents Tlie physical platforms required for the architecture 
described herein are described in FIG. 11. The present 
invention is capable of running on, for example, an NT or 
UNIX server and can be accessed by users through a 
browser or OSS/Data Mining Tools GUI interface via a 45 
network connection. 

The system environment in FIG. 11 provides an overview 
of the architectural components in various configurations. 
DSS/Data Mining can run locally on the database platform 
or remotely on an NT or UNIX server attached to the 50 
database server via a network connection. FIG. 12 is a table 
provkling additional hardware, software, database, and 
information tool options that may be used in conjunction 
with the present invention. 

FIG. 13 is a detailed illustration of the Data Discoverer 55 
architecnire in accordance with the present invention. The 
Data Discoverer 360 is a firont-end tool used to automate as 
much as possible the discovery process on customer infor- 
mation and business rales. The Data Discoverer determines 
or protects the structure of the data stored in the WebWare- 50 
house. Data Discoverer provides the following functions: 

deletes, adds, updates information stored in the WebWare- 
house 

reports information to the customer to allow customer to 
review the information and provide feedback (e.g., 65 
which information customer wants to, or more inter- 
ested in receiving, customized reporting) 
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provides a user interface for setting up new customers to 
collect information to begin webmining operation Web- 
Warehouse 302 is populated with data extracted from 
two primary sources: the CSP's/ISFs business opera- 
tional databases 364, 366 and various Internet or Web- 
based system files 368. The Internet/web data is rela- 
tively consistent from implementation to 
implementation. Minor variances will exist depending 
on whether the customer is running on UNIX or NT, the 
logging options selected by the CSP/ISP and, in some 
cases, by the access hardware. These variances will 
impact the utilities that load the warehouse, but not its 
overall structure. The business operational data, on the 
other hand, may be unique for each CSP/ISP Web- 
Warehouse design templates take this into consider- 
ation by providing a fixed design for the consistent data 
and indicating, specifically, the type of data that must 
be extracted from the CSP's/ISP's operational systems 
to provide the functionality provided by WebSmart. 
User interfece 372 interfaces or interacts with process 
controller 374 to store the data in reference database 
376. User interface receives the data from the external 
sources via a data entry module for input into the 
various areas of the reference database, inchiding 
schema, decision support taxonomy, data collection 
controller, or reference library. 
Each of these various database areas are organized to 
maintain the discovered data in accordance with a predeter- 
mined and expandable format Accordingly, at the end of this 
phase, the structure and location of all source data required 
for the warehouse is defined via the reference database. 
During the data discovery phase, specific decision-support 
objectives are also established and calibrated against the 
mission statement of the business. Hie results of the data 
discovery phase are then used to determine what extensions 
are needed to the WebWarehouse and WebSmart core 
designs. 

FIG. 14 is an illustration of a functional architecture of the 
data discoverer in accordance with the present invention. 
The goal of the data discoverer 360 is to collect operational 
and business data, and then store the collected data in a well 
defined structure. The completed structure, also known as 
metadata, will provide the necessary input to WebWare- 
house Designer, described above. 

In FIG. 14, the service practitioner 380 starts the data 
discovery with an interview questionnaire 382. The ques- 
tionnaire provides a stmcture to collect: 

the data 

a description of the data 

the environmental information about the data (e.g., how 
long files are kept on server without deleting, how 
current is dau refreshed, how information is updated, 
how data is manipulated over time period, what POP or 
physical bcation) 
volatility of the data. 
The data received via the questionnaire is then parsed, 
manually or automatically, into environmental 
characterization/related data 386 described above, and busi- 
ness related data 384. Examples of business related data 
include, e.g., pre-paid user, how user purchases services and 
products, discounts, billing rates, free subscriptions, free 
areas/information on web page. 

Information source requirements 388 are determined 
from, or considers, at least one of the environmental char- 
acterization 386 or business characterization 384 when 
determining or populating the appropriate core business 
rules 392 and the appropriate core data sources 394 that 
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fulfill the information source requirements. Information 
source requirements comprise , for example, identifies 
where, physical/logical location, the information^ business/ 
environment is located in a database. 

G)re business rules 392 comprise a minimum or prede- 
termined set of rules used to administer the subscriber 
account, e.g., predefined rules for the customer to perform 
operations, discounts, rebates, use of subscriber information 
involving logon id per user, account id group of users, 
subscriber id, internal database id, and the like. Core data 
sources 394 comprise the minimum data set which each 
customer is required to supply to the user data, e.g., user 
logon id, valid account, and the like. 

Decision support requirements 390 utilize the business 
characterization when parsing into incremental data sources 
396 and incremental business rules 398. The decision sup- 
port requirements customized support requirements, infor- 
mation needed by customer beyond the core set of report 
functionality, e.g., county of residence which is not normally 
collected. Core data sources 394 and incremental data 
sources 396 are then parsed into or pointed to the appropriate 
intemetAveb source related information 400 and operational 
sources 402. 

Interaet/web source related information 400 and opera- 
tional sources 402 are subsequently stored in warehouse 
designer 330, along with one or more core business rules 
392 and incremental business rules 398. In this phase, the 
data discoverer will obtain the core data set 394. The core 
data set includes, but is not limited to, files, schema, tables, 
report, data elements and relationships. 

The results of the data discovery process is highly itera- 
tive and highly dependent on the customers' needs. The 
purpose of data discovery is to achieve a starting point for 
a systematic way of discovering customer information. 
Interview templates are provided to guide the user through 
the process of identifying and recording information perti- 
nent to WebWarehouse that describes the customers' infor- 
mation systems. 

Business Operational Systems 

Database or file names 

Location of database or file 

Business rational for each database or file 

Database table descriptions and relationships 

Data element descriptions for database tables or files 

Business rules that dictate database or file data integrity 

Update policies and frequency 

Data volumes 

Data volatility 

Internet system logs. 

The data discoverer also collects the following type of 
information: 

Locations of data collection points 
Data volumes 

Operating system type and version 
File formats 

. . . about the following server logs: 
Radius log — records information about access server ses- 
sions 

Access log — records information about requests to local 

web applications 
Proxy log — records information about requests to remote 

web applications 
Referrer log — records information about hyperlinks 

between applications. In context of an application, log 

indicates who is linking to it 
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Mail log 

The Data Discoverer is a self-contained >^^ndows-based 
application that is capable of being nm on a laptop computer. 
Data discovery occurs on the customer's premises. Data 
5 collection templates and processes arc provided to inventory 
or store: 

The types of services provided by the CSP/ISP 

Billing codes 
Service codes 

Subscriber identification codes 
On-line content 

Electronic oonmierce applications. 
With these types of applications, each one has a mutually 
15 exclusive business operations environment, customer base 

and business rules. 
Data collection templates and processes are provi^d to 

collect data pertinent to the customer's internet service 

environment. Included are: 

Location and attributes of POPs (Point of Presence) 
Topology and capacity of access facilities 
Topology and capacity of Internet facilities 
Types, names and locations of servers. 
25 Data collection templates and processes are provided for 

the function of: 

Validating the appropriateness of WebMining provided 

decision-support functionality for the customer's specific 

business needs. 

30 Identifying and describing any additional business needs 
and/or decision support functionality that will require 
extending WebMining's core design and decision support 
templates. 

Information collected about the customer's operational 

35 environment, information systems, and business needs is 
formatted into readable data files that can be integrated into 
WebWarehouse metadata. 

FIGS. 15-15A are illustrations of the architecture of the 
WebWarehouse designer of the present invention. WebWare- 

40 house designer 330 provides a core design that is tailored for 
the decision support functions provided by WebSmart 304 or 
alternatives. Extensions and/or changes to this design, 
however, are required to accommodate the CSP's/lSP's 
operational data and, peiiiaps, any additional decision- 

45 support requirements and/or changjes that surfaced during 
the data discovery phase. WebWarehouse designer 330 con- 
tains two main components: logical data model and format- 
ted library. The logical data model provides overall descrip- 
tions of what the WebWarehouse is. This description 

50 determines how well the WiebWarebouse can integrate addi- 
tional dataset, how effective the access of WebWarehouse 
will be for query and report. Formatted File Library is a 
by-product of conducting collection/extraction utilities and 
tracking utilities. The Formatted File Library provides the 

55 intermediate classifications such as process 
characterizations^ customer descriptions, preference 
determination, and behavior patterns. These classifications 
can be reused in different technical process and difiEerent 
customer problems. 

60 Tables, relationships, keys and data elements are inte- 
grated in the WebWarehouse metadata database 406. Also 
integrated are the locations and descriptions of the source 
data described during data discovery, the business rules, the 
database integrity conventions and the decision support 

65 views. The result is metadata that describes, for example, the 
specific implementation of WebWarehouse that is adminis- 
tered by the WebWarehouse designer. Metadata is data about 
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data. Metadaa keeps track of aod defines what is in the data 
warehouse, as well as the sources of data feeding the 
warehouse. 

WebWarehouse Designer is optionally a Graphic User 
Interface (GUI) based design tool that provides the appro- 
priate logical, physical design and documentation for each 
data warehouse or datamart. Also provided are the design 
templates required to integrate the data elements from the 
CSP's/ISP's operational databases into the appropriate data 
warehouse or datamart. These tools are intended to be used 
by either services practitioners or the customer's informa- 
tion systems staff. The metadata is implemented, for 
example, in a standard RDBMS format that is compatible 
with the RDBMS used to implement WebWarehouse. Other 
data formats may also be used so long as a similar format is 
used or compatible between the WebWarehouse Designer 
and the WebWarehouse. 

WebWarehouse Designer is capable of reading a dataset 
and integrating the data into the WebWarehouse metadata. 
The location and format of customer-provided data is 
recorded during the data discovery process. The Data Dis- 
coverer 360 tool produces an output dataset filtered via the 
delta dataset module 414 used to only transmit data that has 
been altered to the WebWarehouse Designer. That describes 
this information in a predetermined but expandable format. 

In FIG. 15A, the Data Discoverer tool 360 is a user tool. 
The user can be customers or professional service practitio- 
ners. WebWarehouse Designer 330 is a database adminis- 
trator tool. 

In this case, the WebWarehouse Designer 330 contains a 
predetermined and/or customers' design extensions. The 
specific core design of metadata drive the contents of the 
Data Discoverer 360. In conjunction with the user 
(customer) interaction. Data Discoverer 360 produces the 
updated business requirements on Metadata. At this point, a 
manual and/or automated procedure can be chosen to allow 
the design extension of the core design or design changes to 
obtain a customized core design. 

In FIG. 15, the Data Discoverer is integrated into the cOTe 
design by the integration function 412. Administration/ 
Management in user interface 404 provides for Database 
Administrator (DBA) conUx)l and sutus, as required, during 
the integration process. Provisions are provided to allow a 
database administrator or decision support specialist to 
interact with the system. Change Control function in the user 
interface 404 allows elements to be added to the metadata 
406 manually by a database designer. This allows changes 
and enhancemrats to be made. Query Control in the user 
interface 404 provisions for queries and reports when the 
metadata is established. This will provide the WebWare- 
house documentation and will provide a reference for the 
dedsion-support analyst when designing new queries, 
reports or models. 

The translator 408 provides a standard function for access 
to metadau 406 by either WebSmart 304 or WebTrack 300. 
WebTrack 300 requires definitions of customer-provided 
and IntemetAVeb elements in order to manipulate datasets 
during the WebWarehouse load process. Query/report gen- 
erator 410 interfaces with user interface to output reports in 
paper or electronic form. The metadata is extensible. The 
metadata contains a description of each warehouse element 
that is included in the WebWarehouse core design. The core 
design provides the information stmcture that supports Web- 
Smart's pre-defined decision-support functionality, i.e., 
information source requirements described above. The core 
design provided with the system can be extended to imple- 
ment any additional and/or dianged functions desired by the 
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customer, i.e., decision support requirements described 
above in connection with the Data Discoverer. Also, data 
elements required &om the customer's files or databases 
must also be identified and included. 
5 Metadata will provide a comprehensive description of 
each WebWarehouse component and the sources of data 
used to populate Webwarehouse. The metadata comprises 
the following information: 
Schema for Intemet/Web data sources 
to Type of system logs 
Description of each log 
Location of each log 
Data format for each log entry 
15 Operational considerations (e.g., firequency of data 
collection). 
Schema for operational data sources 
Required data sources 

Schema describes the table, files and/or data elements that 
must be added to WebWarehouse to enable the 
decision-support functionality provided by, for 
example, WebSmart 

Table/file location 
2^ Table/file re^nsible party 

Table/file descr^tion 

Table relationships (if database) 

Data element descriptions 

Customer extensions (describes the data sources neces- 
^0 sary to implement customer desired extensions to Web- 
Warehouse functionality). 
Table descriptions including Table type. Table name. 
Table description, Primary Key, Alternate key. Foreign 
33 key(s). 

Data Element Descriptions including Field Name, Label, 
Description, Parent Table, Shared by Alias, Data type. 
Length, Decimal places. Input mask, Display format. 
Legal characters, Key type, Uniqueness, Required 
^ value. Null support, Edit rules. Comparison 
boundaries. Operations allowed. Values entered by. 
Default value. Range of values 
Table Relationship Descriptions 
Business Rules 
45 Decision Support Views. 

The core design of WebWarehouse includes data 
structures, tables and the core data elements required for 
each table. The material is oigacdzed by data warehouse or 
datamart. FIG. 16 is an iUustration of the WebWarehouse 
50 data sources for the services portfolio management, sub- 
scriber base management, and q)plication management. 
FIG. 19 is an exemplary data structure indicating subscriber 
service metrics data stored by the WebWarehouse designer 
and received from the data discoverer process. The data 
55 stmcture section illustrates the tables used in a given data 
warehouse or datamart, the connections between tables that 
arc rclated to one another and the types of relationships 
between them. Also ^own are the primary and foreign keys. 
A primary key uniquely identifies the record in wtiich it 
60 exists. A foreign key in one table is an attribute whose values 
are the values of a primary key in another table. The foreign 
key establishes a link between the two tables. 

FIG. 20 is an exemplary table indicating subscriber ser- 
vice metrics data stored by the WebWarehouse designer and 
65 received from the data discoverer process. The table section 
describes why each table is important to the functionality of 
the data warehouse or datamart This core design reflects the 
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sel of business requirements that is expected to be substan- 
tially common from customer to customer. This core design 
is extensible to accommodate differences in customers* 
operational environments and business needs. Because of 
this and the broad scope of any data warehouse data mining 
effort, it is expected that the designs shown in this section 
will be modified and refined throughout the customer trial 
process. FIG. 21 is an exemplary table indicating the various 
environmental and business data administered and/or col- 
lected by the Data Discoverer process. 

Accordingly, as described above, the present invention 
combines improved methods of data capture with enhanced 
analysis techniques to exploit the vast amount of informa- 
tion available through emerging uses of the web, and other 
internet-based systems. The results can be used as part of a 
customer's decision support system to allow better (e.g., 
more statistically valid) business decisions to be made faster. 

The many features and advantages of the invention are 
apparent from the detailed specification, and thus, it is 
intended by the appended claims to cover aU such features 
and advantages of the invention which fall within the true 
spirit and scope of the invention. Further, since numerous 
modifications and variations will readUy occur to those 
skilled in the art, it is not desired to limit the invention to the 
exact construction and operation illustrated and described, 
and accordingly, all suitable modifications and equivalents 
may be resorted to, falling within the scope of the invention. 

What is claimed is: 

1. In a computer system for supporting and analyzing 
Internet and electronic commerce data over the World Wide 
Web for service providers, the computer system including at 
least one user station transmitting a user request, at least one 
server, connectable to said at least one user station, said at 
least one server providing said at least one user station 
access to external data sources being in a plurality of 
different data formats, and at least one data collection and 
analysis system, a system of integrating data elements from 
an operational database of the service provider including at 
least one of an Internet service provider (ISP) and a com- 
merce service provider (CSP) into a predetermined formal 
for supporting collection and integration of the Internet and 
electronic commerce data, comprising: 

a database storing the integrated electronic commerce and 
Internet data for interrogation by the service provider 
wherein the electronic commerce data is merchant 
transaction data recording a specific purchasing trans- 
action and the Internet data is data collected during web 
browsing by a user; and 
a user station, re^nsively connectable to said database 
and including a design database storing: 
a logical data model providing a description of the 
database facilitating the integration of the plurality of 
different formats of Internet and electronic com- 
merce data and facilitating query and report access of 
the database; 

a formatted file library providing classifications includ- 
ing at least one of process characterizations, cus- 
tomer descriptions, preference determinations, and 
behavior patterns, the classifications being reusable 
for at least one of different technical processes and 
different customer problems; and 

a user interface capable of providing the query access 
and report access of the design database. 

2. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
format for supporting collection of the Internet and elec- 
tronic commerce data according to claim 1, wherein the 
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electronic commerce data includes environmental data, and 
the environmental data includes at least one of how long files 
are kept on a server without deleting, how current is the data, 
how often is the data refreshed, how the data is updated, how 

5 the data is manipulated over time, the physical location, and 
the point of presence. 

3. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and extendable format for supporting collection of the 
Internet and electronic commerce data according to claim 1, 
wherein the Internet and electronic commerce data includes 
business data, and the business data includes at least one of 
pre-paid user services, how the user purchases services and 
products, user discounts, billing rates, free subscriptions, 
and free information. 

15 4. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and extendable format for supporting collection of the 
Internet and electronic commerce data according to claim 1, 
wherein the design database stores information source 

20 requirements, and the information source requirements 
include at least one of physical location and logical location 
where the information is located. 

5. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 

25 format for supporting collection of the Internet and elec- 
tronic commerce data according to claim 1, wherein the 
design database stores core business rules, and the core 
business rules include a minimum predetermined set of rules 
used to conduct customer business operation including 

30 administering a customer account. 

6. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
formal for supporting collection of the Internet and elec- 
tronic commerce data according to claim 1, wherein the 

35 design database stores core data sources, and the core data 
sources include a minimum predetermined data set which 
the customer is required to supply to perform the subsequent 
retrieval of the information. 

7. A system of integrating data elements from an opera- 
40 tional database of a service provider into a predetermined 

and expandable format for supporting collection of the 
Internet and electronic conmierce data according to claim 1, 
wherein the Internet and electronic commerce data informa- 
tion is retrieved from a distributed on-line, and interactive 
45 enviroimient including multiple client browsers. 

8. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 7, 

50 wherein the multiple clients or client browsers are capable of 
accessing a SmartEC decisions server storing information. 

9. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 

55 Internet and electronic commerce data according to daim 8, 
wherein the multiple client browsers arc capable of access- 
ing a server storing information including customer support 
material and sales campaign schedules. 

10. A system of integrating data elements from an opera- 
60 tional database of a service provider into a predetermined 

but flexible format for supporting collection of the Internet 
and electronic commerce data according to claim 1, wherein 
the logical data model and the formatted file library com- 
prise metadata, and the metadata is used to WebTrack and 
65 define the Internet and electronic commerce data stored in 
said database and sources of the Internet and electronic 
commerce data feeding said database. 
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11. A system of integraliog data elements from an opera- 
tional database of a service provider into a predetermined 
but flexible format for supporting collection of the Internet 
and electronic commerce data according to claim 1, wherein 
said system is capable of reading a dataset and integrating s 
the dataset into said database as metadata. 

12. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
format for supporting collection of the Internet and elec- 
tronic commerce data according to claim 1, further com- lo 
prising a translator prcfviding an access function to access 
the logical data model and the formatted file library stored 

in said design database. 

13. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined is 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 1, 
wherein the logical data model and the formatted file library 
comprise metadata, and the metadata includes a description 

of elements stored in said database in accordance with a core ao 
design. 

14. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 25 
13, wherein the core design provides an information struc- 
ture that supports pre-defined decision support functionality. 

15. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 30 
Internet and electronic commerce data according to claim 1, 
wherein the logical data model and the formatted file library 
comprise metadata, and the metadata provides a compre- 
hensive description of each component and source of the 
Internet and electronic commerce data used to populate said 35 
database. 

16. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 1, 40 
wherein the logical data model and the formatted file library 
comprise metadata, and the metadata comprises schema for 
IntemetMeb data sources, and schema for operational data 
sources. 

17. A system of integrating data elements from an opera- 45 
tional database of a sendee provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 
16, wherein the schema for Interaet/web data sources 
includes type of system logs, description of each log, 50 
location of each bg, data format for each log entry, and 
operational consideratiots. 

18. A system of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 55 
electronic commerce data according to daim 16, wherein the 
schema for operational data sources includes tables, files 
and/or data elements that must be added to said database to 
enable decision-support functioQality. 

19. A method of integrating data elements from an opera- 60 
tional database of a service provider into a predetermined 
and expandable format for supporting collection and inte- 
gration of the Internet and electronic commerce data being 

in a plurality of different data formats using a computer, 
comprising the steps of: 65 
(a) storing the integrated electronic commerce and Inter- 
net data for interrogation by the service provider in a 
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database wherein the electronic commerce data is mer- 
chant transaction data recording a specific purchasing 
transaction and the Internet data is data collected during 
web browsing by a user; 

(b) storing in a design database a logical data model 
providing a description of the database facilitating the 
integration of the plurality of different data formats of 
the Internet and electronic commerce data and facili- 
tating query and report access of at least one of the 
databases and the design database; 

(c) storing a formatted file library providing classifica- 
tions including at least one of process characterizations, 
customer descriptions^ preference determinations, and 
behavior patterns, the classifications being reusable for 
at least one of different technical processes and differ- 
ent customer problems; and 

(d) providing the query access and report access to the 
design database. 

20. A method of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 
19, wherein the logical data model and the formatted file 
library comprise metadata, and the metadata is used to track 
and define the Internet and electronic commerce data stored 
in said database and sources of the Internet and electronic 
commerce data feeding said database. 

21. Amisthod of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 
19, wherein said method is capable of reading a dataset and 
integrating the dataset into said database as metadata. 

22. A method of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 
19, further comprising a translator providing an access 
function to access the logical data model and the formatted 
file library stored in said design database. 

23. A method of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable formal for supporting collection of the 
Internet and electronic commerce data according to claim 
19, wherein the logical data model and the formatted file 
library comprise metadata, and the metadata includes a 
description of elements stored in said database in accordance 
with a core design. 

24. A method of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 
23, wherein the core design provides an information struc- 
ture that supports pre-defined decision-support functionality. 

25. A method of integrating data elements from an opera- 
tional database of a sendee provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 
19, wherein the logical data model and the formatted file 
library comprise metadata, and the metadata provides a 
comprehensive description of each component and source of 
the Internet and electronic commerce data used to populate 
said database. 

26. A method of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 
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19, wherein the logical data model and the formatted file 
library comprise metadata, and the metadata comprises 
schema for Intemet/web data sources, and schema for opera- 
tional data sources. 

27. A method of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 
26, wherein the schema for Intemet/web data sources 
includes type of system logs» description of each log, 
location of each log, data format for each log entry, and 
operational considerations. 

28. A method of integrating data elements from an opera- 
tional database of a service provider into a predetermined 
and expandable format for supporting collection of the 
Internet and electronic commerce data according to claim 
26, wherein the schema for operational data sources includes 
tables, files and/or data elements that must be added to said 
database to enable decision-support functionality. 

29. A computer program memory, storing computer 
insUiictions to integrate data elements from an operational 
database of a service provicter into a predetermined and 
expandable format for supporting collection and integration 
of the Internet and electronic commerce data over one or 
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more networks, the integrated electronic commerce and 
Internet data being in a plurality of different data formats, the 
computer instructions inchiding: 

(a) storing in a design database a logical data model 
providing a description of the database facilitating the 
integration of the plurality of different data formats of 
Internet and electronic commerce data and facilitating 
at least one of query and report access to the design 
database, the electronic conmieroe data is merchant 
transaction data recording a specific purchasing trans- 
action and the Internet data is data collected during web 
browsing by a user; 

(b) storing a formatted file library providing classifica- 
tions including at least one of process characterizations, 
customer descriptions, preference determinations, and 
behavior patterns, the classifications being reusable for 
at least one of different technical processes and differ- 
ent customer problems; and 

(c) providing the query access and report access to the 
design database. 
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